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TECHNICAL FIELD 

Disclosed are technologies for regulating gene expression by expression 
of short RNA molecules and methods related to RNA interference. 

BACKGROUND 

RNA interference (RNAi) is a powerful and widely used method to 
inhibit gene product expression in model organisms. RNAi is a highly 
coordinated post-transcriptional mechanism that was first described in 
nematodes. In RNAi, long double stranded RNAs and complex hairpin RNAs 
are processed into small interfering RNAs (siRNAs). These siRNAs are 
generally 21-23 bp RNA duplexes with characteristic dinucleotide overhangs. 
Duplex siRNAs are processed by helicases into single stranded siRNAs, which 
are able to participate in RNA induced silencing complexes (RISC). The RISC 
complex functions as a highly specific endonuclease that usually cleaves target 
RNAs with perfect complementarity to the siRNA in the RISC complex. 

The power of RNAi as a tool lies in two features of the reaction just 
described. First, siRNAs trigger a self-amplifying feedback loop that requires 
only a small number of initial siRNAs to potentially degrade a large number of 
target RNAs. Cleavage of target RNAs by a RISC complex generates additional 
single stranded siRNAs, which in turn are able to participate in additional RISC 
complexes. Second, RNAi exhibits exquisite specificity. A single base pair 
mutation in either the siRNA, or in the target RNA, typically prevents RNAi 
silencing of the target RNA expression. 
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The power of siRNAs has fostered interest in the development of systems 
that can be used for RNAi-mediated silencing of pre-selected target genes in 
mammalian cells. Some systems employ chemical or enzymatically synthesized 
siRNAs to transiently induce RNAi in cells. Other systems use plasmid and 
viral vectors to express hairpin RNAs (siRNA-like transcripts) to stably induce 
the knockdown of expression of pre-selected genes. See, e.g., Brummelkamp et 
aL, Science 296:550-553 (2002) and Novina et aL, Nat Med 8, 681-686 (2002); 
Rubinson et al.,Nat. Genet 33:401-406 (2003). A third class of systems employ 
technologies that allow for conditional expression of siRNA-like transcripts. 
Czaudema et al.,. Nucleic Acids Res 31 :el2 (2003) and Kasim et aL, Nucl. 
Acid. Res. Supp. No 3: 255-256 (2003). 

SUMMARY 

Described herein are new expression systems for inducibly producing 
short RNA transcripts. The short RNA expression systems inducibly produce 
pre-selected short RNA transcripts, for example, short RNA transcripts designed 
to induce gene silencing. In view of the reportedly stringent sequence- 
dependence of certain gene silencing mechanisms, the present systems have the 
advantage of being able to inducibly produce highly specific short RNA 
molecules. Also disclosed herein are transgenic animals and cells carrying the 
disclosed short RNA expression systems. Because the systems of the present 
invention are inducible, they can be used to study the role of essential genes in 
cells and animals in ways that are not possible with constitutive expression 
systems. Additionally, the inducible expression system of the present invention 
can be used to study the effects of induced gene silencing in specific tissues. 

In general, a short RNA expression system typically features a nucleic 
acid molecule that includes the following sequence components: a promoter 
sequence capable of transcribing short RNA transcripts, a short RNA encoding 
sequence that encodes a short RNA transcript, and a STOP cassette (that 
includes a removable termination sequence) disposed between the promoter 
sequence and the short RNA encoding sequence. 

Short RNA transcripts are transcripts with fewer than 400 bases, e.g., 
fewer than 201 bases, fewer than 150 bases, or fewer than 100 bases in length. 
In some examples, short RNA transcripts are transcripts with fewer than 80 
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bases, e.g., fewer than 70 bases, fewer than 60 bases, or fewer than 50 bases in 
length. Short RNA transcripts include RNA molecules capable of eliciting 
RNAi-inediated or micro-RNA-mediated gene silencing. 

A STOP cassette typically includes: a termination sequence capable of 
preventing or terminating transcription by the RNA polymerase that binds the 
promoter sequence. A STOP cassette can be flanked by a first loxP sequence and 
a second loxP sequence. The termination sequence is positioned along the 
nucleic acid between the promoter sequence and the transcription initiation site 
of the short RNA encoding sequence in the nucleic acid molecule. Optionally, 
the short RNA encoding sequence overlaps with one of the loxP sequences. 

In a one aspect, a short RNA expression system is a nucleic acid 
molecule that includes: an RNA polymerase m promoter sequence; a short 
RNA encoding sequence that includes a transcription initiation site; and a STOP 
cassette. The STOP cassette includes an RNA polymerase m-specific 
termination sequence, a first loxP sequence and a second loxP sequence. The 
loxP sequences flank the termination sequence, and the termination sequence is 
disposed between the promoter sequence and the transcription initiation site of 
the short RNA encoding sequence in the nucleic acid molecule. Optionally the 
short RNA encoding sequence overlaps with one of the loxP sequences. 

In some embodiments, the first loxP sequence is a wild-type loxP 
sequence. In some embodiments of the first aspect, the second loxP is a wild- 
type loxP sequence. In certain embodiments of the first aspect, the first loxP 
sequence and the second loxP sequence are mutant loxP sequences. In certain 
examples, the first and second loxP sequences contain one or more mutations in 
the 8 base spacer region of the two loxP sequences. 

In some embodiments, (i) the first loxP sequence, located upstream of the 
termination sequence, is a wild-type loxP sequence, and (ii) the second loxP 
sequence, located downstream from the termination sequence, is a mutant loxP 
sequence. For example, the second loxP sequence can contain sequence that 
overlaps with some or all of the short RNA encoding sequence. In certain 
examples of this embodiment, 1 to 10 nucleotides in the terminal sequence of the 
loxP that is proximal to the short RNA encoding sequence consist of the 5' 
terminal nucleotides of the short RNA encoding sequence. In some these 
examples, the five terminal nucleotides in the loxP sequence consist of the five 
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5 9 terminal nucleotides of the short RNA encoding sequence. The five 5' 
terminal nucleotides of the short RNA encoding sequence is the sequence that 
includes the (+1) through (+5) positions of the transcript encoding sequence. 

In some embodiments, the nucleic acid includes a thymidine nucleotide 
in the sequence position that immediately precedes the upstream terminal 
sequence of the loxP sequence that is located upstream of the termination 
sequence. In some examples of this embodiment, the first loxP is a wild type 
loxP sequence. In some examples of this embodiment, the second loxP 
sequence, located downstream of the termination sequence, is a mutant loxP 
sequence, wherereinl to 10 nucleotides in the terminus of the loxP that is 
proximal to the short RNA encoding sequence consist of the 5' terminal 
nucleotides of the short RNA encoding sequence. 

In some embodiments, the promoter sequence includes some portion of 
the RNA polymerase m promoter sequence from the genomic sequence of the 
small nuclear RNA U6 promoter. Examples of this embodiment include nucleic 
acids with a STOP cassette that includes from 1 to 190 bases of the genomic 
sequence that is immediately downstream of the small nuclear RNA U6 genomic 
transcription termination signal. In another example, the STOP cassette includes 
a modified genomic U6 transcription termination sequence that includes from 1 
to 20 additional thymidine nucleotides disposed immediately adjacent to the 
wild-type U6 thymidine termination signal (or T-stretch); and the STOP cassette 
also includes some number, from 1 to 190, inclusive, of nucleotides from the 
wild-type U6 genomic sequence that is immediately downstream of the 
thymidine termination sequence. In some examples, the termination sequence 
includes more than one T-stretch and also includes some number, from 1 to 190, 
inclusive, of nucleotides encoding the wild-type U6 genomic sequence that is 
immediately downstream of the thymidine termination sequence. Some 
examples of this embodiment also include a wild-type loxP sequence. Some 
examples of this embodiment also include the mutant loxP sequences described 
above, i.e. those in which 1 to 10 nucleotides in the terminus of the loxP that is 
proximal to the short RNA consist of the 5' terminal nucleotides of the short 
RNA encoding sequence. 

In other embodiments of the first aspect, the short RNA encoding 

sequence encodes a transcript with fewer than 400, e.g., fewer than 200, fewer 
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than 100, fewer than 70, fewer than 60, fewer than 50, fewer than 40, or fewer 
than 30 nucleotides. Examples of this embodiment also include one or more of 
the following: any of the promoter sequences, any of the termination sequences, 
the wild-type loxP sequence, or any of the mutant loxP sequences that are 
described herein. 

In another aspect, described herein is a transgenic animal that has 
incoiporated into its genome any of the nucleic acids described herein, for 
example, the nucleic acids described above. 

In one embodiment, the transgenic animal also includes a nucleic acid 
molecule encoding a Cre recombinase. In one example of this embodiment, 
expression of the Cre recombinase is developmentally regulated, e.g., the Cre 
recombinase is maximally expressed only at one or more specific stages of 
embryonic or animal development. In another example of this embodiment, 
expression of the Cre recombinase is tissue-specific, e.g., the Cre recombinase is 
maximally expressed only in one or more specific cell types. 

In some embodiments, the transgenic animal described herein is a mouse, 
a rat, a goat, a pig, a monkey, a cow, a rabbit, a sheep, a hamster, a chicken, or a 
frog. In one example of this embodiment, expression of the Cre recombinase is 
developmentally regulated, e.g., the Cre recombinase is maximally expressed 
only at one or more specific stages of embryonic or animal development In 
another example, expression of the Cre recombinase is tissue-specific, e.g., the 
Cre recombinase is maximally expressed only in one or more specific cell types. 

In another aspect, a cell is described that includes any of the nucleic acids 
described herein, for example, the nucleic acids described above. In one 
embodiment, the cell is an animal cell, e.g., the cell is a mammalian cell. lot 
another embodiment, the cell is an embryonic stem cell. In some embodiments, 
the cell is prokaryotic cell, e.g., a bacterial cell. 

In some embodiments, any cell described herein can also include a 
nucleic acid molecule encoding a Cre recombinase gene. In some embodiments, 
any cell described, herein can also include a Cre recombinase protein. 

In yet another aspect, a method is described for making an inducible 
short RNA expression system. The method includes linking two or more nucleic 
acids to produce any of the new nucleic acids described herein, e.g., the nucleic 
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acids described in the first aspect of the invention (including its embodiments 
and examples). 

In one aspect, a method of making a transgenic animal is described. In 
one embodiment, the method includes introducing into the genome of an 
embryonic stem (ES) cell any of the nucleic acid molecules described herein, 
e.g., the nucleic acids described above, to generate a transgenic ES cell. The 
method also includes introducing the transgenic ES cell into an embryo, 
implanting the embryo into an animal capable of carrying the embryo to term, 
and allowing the embryo to come to term, thereby generating a transgenic 
animal. The method can generate a chimeric transgenic animal, and the method 
can further include crossing the chimeric transgenic animal to another animal of 
the same species to generate a founder transgenic animal. 

In another embodiment, the method includes introducing into the genome 
of an oocyte any of the nucleic acid molecules described herein, e.g., the nucleic 
acids described in the first aspect (including its embodiments and examples). 
The method also includes fertilizing the oocyte to produce an embryo, 
implanting the embryo in an animal capable of carrying the embryo to term, and 
allowing the embryo to come to term, thereby generating a transgenic animal. 

In another aspect, a method is described for making an animal cell 
containing an inducible short RNA expression system. The method includes 
transfecting a cell with any of the nucleic acid molecules described herein, e.g., 
the nucleic acids described in the first aspect (including its embodiments and 
examples). In an example of the method, the transfected cell is a cell from any 
one of the following animals: a human, a mouse, a rat, a goat, a pig, a monkey, a 
cow; a rabbit; a sheep, a chicken, a frog, or a fish. 

In still another aspect, described herein is a method of screening for gene 
function in a cell. The method includes: providing any of the cells described 
herein, e.g., the cells of the third aspect, inducing transcription of the short RNA 
encoding sequence; and monitoring changes in the cell. 

In yet another aspect, a method is described of studying gene function in 
an organism. The method includes: providing any of the transgenic animals 
described herein, e.g., the transgenic animals described in the second aspect of 
the invention, inducing transcription of the short RNA encoding sequence; and 
monitoring changes in the organism. 
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In another aspect, described herein is a method of identifying a candidate 
RNAi effector that has altered expression in T-cells. The new method includes 
administering or inducing expression of siRNA in a T-cell and a control cell, 
evaluating expression of mRNAs or proteins in the T-cell and the control cell, 
and identifying an mRNA or proteins with (a) reduced levels or (b) that is 
differently modified in the T-cell relative to control. Generally, an mRNA or 
protein with reduced levels or that is differently modified in the T-cell relative to 
control is a candidate RNAi effector that has altered expression in T-cells. A 
control cell should not be a mature lymphocyte. 

In further embodiments, the method also includes (a) introducing a 
candidate RNAi effector into or (b) modifying the identified candidate RNAi 
effector in T-cells, and then determining if (a) or (b) increases RNAi efficiency 
in the T-cell. A candidate that increases RNAi efficiency in the T-cell is a 
missing RNAi effector. 

In another aspect, described herein is a method of treating a patient. 
Generally, the method includes administering to a patient a nucleic acid 
molecule that includes an inducible short RNA expression system described 
herein, e.g., any inducible short RNA expression system described above. A 
patient is typically in need of having expression of one or more genes reduced. 
The short RNA expression system will encode a short RNA transcript designed 
to reduce expression of one or more genes that the patient is in need of reducing. 

In yet another aspect, a patient in need of cell-based therapy is 
administered a cell containing a nucleic acid described herein, e.g., a cell 
described that contains an inducible short RNA expression system. 

In another aspect, described herein is a method of identifying a candidate 
inhibitor of RNAi in T-cells. The new method includes administering or 
inducing expression of siRNA in a T-cell and a control cell, evaluating 
expression of mRNAs or proteins in the T-cell and the control cell, and 
identifying an mRNA or protein with (a) increased levels or (b) that is differently 
modified in the T-cell relative to control. Generally an mRNA or protein with 
increased levels or that is differently modified in the T-cell relative to control is 
a candidate inhibitor of RNAi in T-cells. A control cell should not be a mature 
lymphocyte. 
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In further embodiments, the method also includes introducing the 
identified candidate inhibitor of RNAi into a cell and determining if it reduces 
RNAi efficiency in the cell. Generally, if candidate inhibitor reduces RNAi 
efficiency in the cell is an inhibitor of RNAi or if a candidate inhibitor. In other 
further ambodiment, the method also includes inactivating the candidate 
molecule in a T-cell and determining if the inactivation increases RNAi 
efficiency in the T-celL Generally if the inactivation increases RNAi efficiency 
in a cell, then the candidate inhibitor is an RNAi inhibitor in T-cells. 

Terms 

"Short RNAs" and "short RNA transcripts" are ribonucleic acids, 
typically less than 400 bases in length. Some short RNAs are capable of 
eliciting RNAi-mediated or Micro-RNA-mediated gene silencing. 

"Short RNA encoding sequence" is a nucleic acid sequence coding for a 
short RNA transcript. Typically a short RNA encoding sequence will be a DNA 
sequence coding for a short RNA transcript. A short RNA encoding sequence 
can also be an RNA sequence, e.g., in an RNA virus vector, that encodes, e.g., by 
reverse transcription, a short RNA transcript 

'Transcription unit" is a nucleic acid that includes a promoter sequence, a 
transcript sequence, and a transcript termination sequence. 

As used herein to describe a nucleic acid sequence, the term "closer" 
indicates that a first nucleic acid sequence is separated from a second nucleic 
acid sequence by fewer intervening nucleotides in terms of a linear sequence. 
For example, as used herein, a statement that nucleic sequence A (A) is closer to 
nucleic acid sequence B (B) than nucleic acid sequence C (C), means that there 
are fewer intervening nucleotides in the linear sequence between A and B than 
there are intervening nucleotides in the linear sequence between B and C. 

Unless otherwise defined, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the 
art to which this invention belongs. Although methods and materials similar or 
equivalent to those described herein can be used in the practice or testing of the 
present invention, suitable methods and materials are described below. All 
publications, patent applications, patents, and other references mentioned herein 
are incorporated by reference in their entirety. In case of conflict, the present 
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specification, including definitions, will control. In addition, the materials, 
methods, and examples are illustrative only and not intended to be limiting. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 A is a set of diagrams of the U6-STOP-shAl construct (above) 
before a Cre-mediated excision of the termination sequence, and (below arrow) 
after a Cre-mediated deletion of the termination sequence. The mutant loxP 
sequence after Cre-mediated deletion is shown. 

Figure IB is a diagram of the targeting strategy for inserting the U6- 
STOP-shAl construct into the HPRT locus of HM1 stem cells and a Southern 
Blot confirming insertion of the U6-STOP-shAl construct 

Figure 2A is a set of diagrams of the Al-IRES-EGFP reporter construct 
and control EGFP and mutAl-IRES EGFP reporter construct. 

Figure 2B is a series of graphs depicting Cre-dependent inhibition of 
EGFP expression in cells carrying Al-IRES-EGFP reporter construct relative to 
controls. 

Figure 2C is a diagram (left) indicating the locations of polymerase chain 
reaction (PCR) primers used to detect Cre-mediated deletion of the STOP 
cassette in the U6-STOP-shAl construct and (right) three images of 
electropheretic gels containing the indicated PCR products. 

Figure 2D is a Northern blot image indicating siAl expression in the 
indicated ES cell lines. 

Figure 2E is a set of Northern blot images indicating expression of siAl 
sense and antisense strands in mutAl ES cells. 

Figure 2F is a set of Northern blot images indicating expression of GFP 
in the indicated ES cell lines. 

Figure 3 A is a Northern blot indicating expression of siAl in 
lymphocytic cells from indicated transgenic mice. 

Figure 3B is a set electrophoresis gel images loaded with the indicated 
PCR products. 

Figure 4A is a histogram comparing expression of Al mRNA (as a ratio 
of Al to L32 mRNA) in the indicated cell lines from transgenic mice. 

Figure 4B is a histogram indicating expression of Al mRNA (as a 
percentage of wild type levels) in the indicated cell lines from transgenic mice. 
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Figure 5 A is a histogram comparing expression of Al mRNA (as a ratio 
of Al to L32 mRNA) in unstimulated {ex vivo) or stimulated (CD3/CD28) 
indicated cell lines from transgenic mice. 

Figure 5B is a Northern blot image comparing the expression of siAl in 
unstimulated or stimulated indicated cell lines from transgenic mice. 

Figure 6A is histogram and a set of Northern blot images indicating the 
expression of Al mRNA (as a ratio of Al to L32 mRNA, histogram) or 
exression of siAl (Northern blots) in the indicated cell lines. 

Figure 6B is a histogram comparing Al rnRNA (as a ratio of Al to L32 
mRNA) in bone marrow derived macrophages (BMDM) from the indicated 
transgenic mice. 

Figure 6C is a histogram comparing Al rnRNA (as a ratio of Al to L32 
mRNA) in wild-type BMDM and CD19 + splenic B cells. 

Figure 6D is a set of Northern blot images comparing siAl expression 
from the indicated transgenic mouse-derived cell types. 

Figure 7 is a set of Western blot images comparing the expression of 
target proteins in cell lines expressing siRNAs targeted to Itch or Nedd-4. 

DETAILED DESCRIPTION 

The following is a description of specific embodiments of the invention. 
The inducible short RNA expression systems and methods are described in 
conjunction with specific nucleic acid sequences. Nevertheless, it should be 
recognized that the inducible expression system and methods described in the 
present specification and the claims may also be used in conjunction with other 
nucleic acid sequences. Although the inducible short RNA expression systems 
are described as useful in methods that regulate gene expression through RNAi 
and micro-RNA induced mechanisms, it should be recognized that the systems 
are also useful in other methods, e.g. in applications that require the expression 
of short RNAs for purposes other than RNA-mediated gene product regulation. 

Brief Hew of the Novel Expression Systems 

The components of the expression systems include an RNA Polymerase 
m (Pol m)-specific promoter sequence, a /oxP-flanked STOP cassette sequence, 
and a short RNA encoding sequence. These three nucleotide sequences are 
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arranged on a nucleic acid such that the promoter is upstream of the STOP 
cassette, and STOP cassette is upstream of the short RNA encoding sequence. 
The terms upstream and downstream as used herein refer to the direction of 
productive transcription on a nucleic acid molecule starting from the Pol III 
promoter's transcription start site. Productive transcription starts from an 
upstream position on a nucleic acid molecule and proceeds downstream along 
the molecule, until transcription is terminated. Thus, in the present systems, the 
short RNA encoding sequence is downstream of the STOP cassette, and the 
STOP cassette is downstream of the Pol Hi-specific promoter. The relative 
locations of these three components in the present system prevents transcription 
of the short RNA encoding sequence by RNA polymerase III because the STOP 
cassette's termination sequence is located between the Pol III promoter and the 
short RNA encoding sequence. 

When Pol HI polymerase assembles on the Pol m promoter sequence of 
the systems, it proceeds downstream from the promoter sequence towards the 
short RNA transcript. Before it reaches the short RNA encoding sequence, 
though, the polymerase encounters the termination sequence in the STOP 
cassette. The termination sequence causes the polymerase to abort the 
transcription reaction before any short RNA encoding sequence is transcribed. 

Transcription of short RNA transcripts in the systems can be induced by 
causing the nucleic acid to be contacted by a Cre recombinase. Cre recombinase 
can catalyze the excision of the STOP cassette from the nucleic acid, thereby 
producing a nucleic acid that no longer contains a transcription termination 
signal between the promoter sequence and the short RNA encoding sequence. 
Cre-mediated excision of the STOP cassette in the present systems modifies the 
nucleic acids of the systems disclosed herein to allow Pol IH promoter driven 
transcription of the short RNA encoding sequence. 

Detailed view of the Nucleic Acids of the Novel Short RNA Expression Systems 
1. Promoter Sequences 

Promoters that can be used in the short RNA expression system of the 
present invention are nucleic acids that include a promoter sequence capable of 
driving expression of short RNAs, e.g., RNAs which can induce RNAi or micro- 
RNA mediated gene silencing. Preferred promoters are those whose 
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transcription start and stop sites are very predictable and precise. Examples of 
such promoters are the RNA polymerase in (Pol EGQ-specific promoters, which 
include the Pol III type 3 core promoters, which are described in detail in 
Schramm and Hernandez, Genes & Dev. 16:2593-2620 (2002). Pol III promoter 
sequences are DNA sequences that recruit Pol IE, i.e. on which Pol III can 
assemble inside of a cell, for the first step of a Pol III transcription reaction. 

Promoters that can be used in the present invention can include the U6 
snRNA gene (U6) promoter sequence. The U6 gene is transcribed by Pol III and 
encodes the U6 snRNA component of the splicesosome. The U6 promoter 
sequence can be the U6 promoter sequence from a mammal, including a human 
or a mouse, or it can be the U6 promoter sequence from a non-mammalian 
animal. Other Pol HI promoters that can be used in the present invention include 
promoter sequences that drive transcription of the HI RNAse P gene (HI). The 
HI promoter sequence can be the sequence of the HI promoter from a human, a 
mouse, a mammal, or an animal. 

The U6 and HI Pol HI transcription units share several unusual features. 
First, none of the promoter elements, except the (+1) transcription start site, is 
located in the transcribed region of either the U6 or HI gene. This feature means 
that almost any pre-selected sequence can be placed downstream the U6 or HI 
promoter start site, and Pol IH will drive expression of that sequence. Second, 
Pol HI promoters, e.g., the U6 and HI promoters, start transcription from 
precisely defined distances, i.e., between 32 and 25 bp, downstream of the TATA 
box. This feature provides the necessary control for the expression of short pre- 
selected transcripts. Third, Pol HI recognizes a run of 4-5 thymidine residues as 
a termination signal. This feature not only allows for easy control of transcript 
termination, but also results in overhanging uridines, which resembles the 
overhanging uridines or thymidines at the end of synthetic siRNAs. Finally it is 
worth noting that Pol m normally transcribes only very short genes, generally 
less than 400 bp. 

2. The STOP Cassette 

The STOP cassettes of the present invention are nucleic acids. The 
nucleotide sequence of these nucleic acids includes: a transcription termination 
sequence and two loxP sequences. The two loxP sequences flank the termination 
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sequence, i.e., one loxP is positioned at the 5' terminus of the termination 
sequence, (i.e. upstream of the termination sequence) and the other loxP is 
positioned at the 3' terminus of the termination sequence (i.e. downstream of the 
termination sequence). 

The choice of termination sequence used in a STOP cassette will depend 
on the polymerase activity the STOP cassette is designed to terminate. Thus, if 
the promoter sequence used in a system of the present invention is a Pol EI 
promoter sequence, then the termination sequence used in the system is a 
sequence capable of preventing or terminating Pol Dl transcription. If the 
promoter sequence is one that recruits another kind of polymerase, then the 
transcription termination sequence of the STOP cassette is a sequence capable of 
preventing or terminating transcription of that other kind of polymerase that is 
recruited by the promoter. 

The Pol III polymerase is unique in its ability to recognize a simple run 
of four to five consecutive thymidines as a termination signal (T-stretch). See, 
e.g., Schramm and Hernandez, Genes & Dev. 16:2593-2620 (2002). 
Transcription termination can be enhanced by including multiple T-stretches at 
the end of a Pol IH transcribed gene. Transcription termination can also be 
enhanced by increasing the number of consecutive thymidines in a T-stretch. 
Furthermore, reports have also suggested that untranscribed sequence 
downstream of the termination signal can affect the termination efficiency of Pol 
IE termination signal. Das et al., EMBO J. 7:503-512 (1988). 

When a Pol HI promoter is used in a system of the present invention, 
appropriate termination sequences for use in the system can be sequences that 
include a run of four to five consecutive thymidines. The termination sequence 
can optionally include more than 5 consecutive thymidines. The termination 
sequence can optionally include untranscribed downstream sequences from 
known genomic Pol HI termination signals. 

For example, when a Pol III promoter is used in a system of the present 
invention, the termination sequence can include sequences that are downstream 
of the genomic U6 termination signal. The termination sequence can include 
any number, from 50 to 190, of bases of the wild-type genomic U6 sequence that 
is downstream of the U6 gene's T-stretch. 
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Other examples of termination sequences that can be used in conjunction 
with a Pol HI promoter sequence in systems of the present invention can include 
sequences that are downstream of the HI termination signal. The termination 
signal can include any number, from 20 to 190, of bases of the wild-type HI 
sequence that is downstream of the HI gene's T-stretch. 

The loxP sequences in the STOP cassette can include wild-type loxP 
sequences or one or two mutant lox P sequences. Wild-type LoxP sequences are 
34 base pair (bp) sequences that are recognized by the Cre recombinase in 
reactions described more fully below. A wild-type loxP sequence is consists of 
two 13 bp inverted repeats separated by an 8bp spacer region. The loxP 
sequence has been published and is also provided in the Example below. See, 
e.g., Sauer, B., Nucl. Acids Res. 24:4608-4613 (1996). It is worth noting that to 
be functional, a wild-type loxP sequence must be on a double stranded DNA 
molecule. The systems of the present invention are not limited to double 
stranded DNA molecules. For example, the present invention contemplates the 
use of retroviruses that carry sequences coding for a promoter, a loxP -flanked 
terminator sequence, and a short RNA encoding sequence. Such retroviruses 
might be used to insert DNA molecules in the genome of a host, thereby 
generating a functional inducible expression system. The terms "wild-type 
loxP" sequence or "mutant loxP sequence" therefore should also be understood 
to include single stranded DNA sequences and RNA sequences coding for 
functional DNA loxP sequences. 

In some embodiments the expression system of the present invention will 
include one mutant loxP sequence. The mutant loxP sequence can be the loxP 
sequence that is upstream or the loxP sequence that is downstream of the 
termination sequence in the STOP cassette. Some mutant loxP sequences will 
contain one or more mutated bases in the terminal 10 bases of one terminus of a 
loxP sequence. The terminus of a loxP sequence refers to one of the two 5' and 
3' ends of the loxP sequence. Thus every loxP in a STOP cassette contains two 
termini, an upstream and a downstream terminus relative to the direction of 
productive transcription generated by the promoter sequence in the system. The 
terminal 10 bases of a loxP terminus are the ten consecutive bases that constitute 
one of the two termini of a loxP sequence. 
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In some embodiments the mutated loxP sequence will include one or 
more mutant bases in the downstream terminus of the loxP sequence that is 
downstream of the termination sequence. Examples of such mutants are loxP 
mutants are loxP sequences that contain one or more mutation in the 10 bases of 
the downstream terminus. In some examples the mutant downstream loxP 
terminal sequence will overlap with the first 1-10, e.g., 5, bases of the short RNA 
encoding sequence. In other words the downstream terminal sequence, of the 
loxP sequence located downstream of the termination sequence, can include, or 
overlap with, the upstream terminal sequence of the short RNA encoding 
sequence. The usefulness of such mutant loxPs is explained below. 

3. Short RNA Encoding Sequences 

The short RNA encoding sequences of the present invention are nucleic 
acid sequences coding for short RNA transcripts. Short RNA transcripts are 
transcripts consisting of 120 nucleotides or less. Short RNA encoding sequences 
include those that code for siRNA-like hairpins, which can be between 10 and 40 
nucleotides in length. In some systems short RNA encoding sequences encode 
transcripts that are between 15 and 30 nucleotides in length. In some systems 
short RNA encoding sequences encode transcripts that are between 18 and 24 
nucleotides in length. Many short RNA encoding sequences include sequences 
coding for transcripts that can activate a cell's RNAi gene silencing mechanisms. 

Short RNA transcripts also include micro-RNA-like precursors and micro 
RNA-like transcripts. Micro-RNA precursors can be approximately 70 
nucleotides in length. Lee et al., EMBO J. 21:4663-4670 (2002). Processed 
Micro RNAs can be much smaller, e.g., from 10-40 nucleotides long, or 15-30 
nucleotides long, or most frequently between 18-24 nucleotides long. Micro- 
RNAs mediate gene-silencing through a different mechanism than RNAi. 
Unlike siRNAs MicroRNAs are not usually perfectly complementary to their 
targets. Short RNA encoding sequences in the present system include sequences 
coding for transcripts that activate a cells micro-RNA mediated gene-silencing 
mechanisms. 

In keeping with standard molecular biological usage, the first nucleotide 
of the short RNA transcript is encoded by the transcription initiation (+1) site of 
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the short RNA encoding sequence. The transcription initiation site is therefore 
upstream of every other nucleotide in the short RNA encoding sequence. The 
second nucleotide in the short RNA encoding sequence that is transcribed can be 
referred to as the (+2) position, and the third nucleotide in a developing 
transcript is coded for by the (+3) position in the short RNA encoding sequence, 
etc. 

In some embodiments the upstream portion of the short RNA encoding 
sequence overlaps with the closest, i.e., proximal loxP sequence in the nucleic 
acid. (The proximal loxP to the short RNA encoding sequence is the 
downstream loxP relative to the other loxP in the system). In these embodiments 
the downstream terminal sequence of the short RNA encoding sequence- 
proximal loxP sequence is the upstream sequence of the short RNA encoding 
sequence. Stated differently, the downstream terminal sequence of the 
downstream loxP contains the transcription initiation site of the short RNA 
encoding sequence, and optionally includes one or more bases of additional short 
RNA encoding sequence. 

In some embodiments of the system, the 10 terminal bases of the 
downstream terminal of the downstream loxP sequence are also the +1 through 
+10 positions of the short RNA encoding sequence. In other embodiments 5 
terminal bases of the downstream terminal of the downstream loxP sequence are 
also the +1 through +5 positions of the short RNA encoding sequence. 

Termination of transcription of the short RNA encoding sequences is 
achieved by placing a termination signal immediately downstream of the short 
RNA encoding sequence. In the present system, the most downstream portion 
the short RNA encoding sequence will contain the first one, two, or three 
thymidines of the stretch of consecutive thymidines that represents a Pol III 
termination signal. 

Functional Equivalents 

Skilled artisans will recognize that functional equivalents can be used in 
place of certain sequences described herein, in conjunction with the inducible 
expression systems disclosed herein. For example, in one embodiment, a 
functional equivalent can be used instead of the mouse genomic U6 promoter 
sequence provided in Table 2 of Example 1. Functional equivalents of the 
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mouse U6 promoter sequence include sequences that differ by one or more bases 
from tire sequence provided in Table 2 and that retain an ability to recruit RNA 
polymerase III in the first step of a reaction that leads to productive RNA 
transcription. Similarly, the functional equivalent of any other Pol m promoter 
sequence, e.g. the human genomic U6 promoter sequence, the human or mouse 
genomic HI promoter sequences, include sequences that differ by one or more 
bases from the Pol in promoter sequences and also retain an ability to recruit Pol 
III in the first step of a reaction that leads to productive transcription. 

Functional equivalents can also be used instead of genomic sequences 
downstream of a Pol Id termination signal. 

Functional equivalent sequences include those sequences that also have a 
high percentage of identity to the sequences already known to skilled artisans 
and/or those sequences disclosed herein that can be used in conjunction with the 
expression systems of the present invention. Functional equivalents include 
sequences with 99%, 98%, 97%, or any percentage higher than 90%, or any 
percentage higher than 80%, or any percentage higher than 70%, identity to a 
known or disclosed sequence. 

To determine the percent identity of two nucleic acid sequences, the 
sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second nucleic acid sequence for 
optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a preferred embodiment, the length of a reference 
sequence aligned for comparison purposes is at least about 30%, preferably at 
least about 40%, more preferably at least about 50%, even more preferably at 
least about 60%, and even more preferably at least about 70%, 80%, 90%, or 
100% of the length of the reference sequence. The nucleotides at corresponding 
nucleotide positions are then compared. When a position in the first sequence is 
occupied by the nucleotide as the corresponding position in the second sequence, 
then the molecules are identical at that position (as used herein nucleic acid 
"identity" is equivalent to nucleic acid "homology"). The percent identity 
between the two sequences is a function of the number of identical positions 
shared by the sequences, taking into account the number of gaps, and the length 
of each gap, which need to be introduced for optimal alignment of the two 
sequences. 
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The comparison of sequences and determination of percent identity 
between two sequences can be accomplished using a mathematical algorithm. In 
a preferred embodiment the percent identity between two nucleotide sequences 
is determined using the GAP program in the GCG software package (available at 
http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 
50, 60, 70, or 80 and a length weight of 1 , 2, 3, 4, 5, or 6. A particularly 
preferred set of parameters (and the one that should be used if the practitioner is 
uncertain about what parameters should be applied to determine if a molecule is 
within a sequence identity or homology limitation of the invention) are a 
Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, 
and a frameshift gap penalty of 5. 

The percent identity between nucleotide sequences can be determined 
using the algorithm of E. Meyers and W. Miller (CABIOS, 4:1 1-17 (1989)) 
which has been incorporated into the ALIGN program (version 2.0), using a 
PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences described herein can be used as a 
"query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be 
performed using the NBLAST and XBLAST programs (version 2.0) of Altschul 
et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to 26493 nucleic acid molecules of the 
invention. To obtain gapped alignments for comparison purposes, Gapped 
BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 
25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can 
be used. See National Center for Biotechnology Information web site 
(www.ncbi.nlm.nih.gov). 

Methods of Making the Nucleic Acid of the Present Invention 

Techniques and methods for engineering recombinant nucleic acids are 
well known in the art. Examples of such techniques and methods include, 
enzymatic nucleotide restrictions, site directed mutagenesis, and in vitro 
transcription. 
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Methods of Using the Nucleic Acids of the fresent Invention 

The nucleic acids of the present invention can be placed inside living 
cells and organisms. For example, the nucleic acids of the present invention can 
be placed in nucleic acid vectors which are subsequently introduced into hosts 
by a variety of methods which are known in the art, e.g., transformation, 
transfection, electroporation, and liposome delivery. Examples of vectors 
include plasmids, phages, cosmids, phagemids, yeast artificial chromosomes 
(YAC), bacterial artificial chromosomes (BAC), human artificial chromosomes 
(HAC), viral vectors, such as adenoviral vectors, retroviral vectors, and other 
DNA sequences which are able to replicate or to be replicated in vitro or in a 
host cell, or to convey a desired DNA segment to a desired location within a host 
cell. 

Examples of organisms that can be hosts for vectors carrying the nucleic 
acid of the present invention include bacteria, yeast, flies, nematodes, animals 
and mammals. Examples of cells that can be hosts to vectors carrying the 
nucleic acids of the present invention include cells available from the American 
Type Culture Collection (ATCC) (Manassas, VA). 

Transgenic Animals 

In some embodiments of the invention the nucleic acids of the disclosed 
expression system are integrated into the genome of transgenic animals. 
Transgenic animals can be generated by introducing the nucleic acids disclosed 
herein into the germline of an animal. Methods for introducing nucleic acids 
into the germline of animals and generating transgenic animals, e.g. chimeric 
transgenics or founder lines of transgenics, are known in the art. See, e.g., 
Torres, R. M. and Kuhn, R., Laboratory Protocols for Conditional Gene 
Targeting, Oxford University Press, Oxford, U.K. (1997) and Nagy et al., 
Manipulating the Mouse Embryo: A Laboratory Manual (Third Edition) Cold 
Springs Harbor Laboratory Press, Woodbury, NY (2003). The Example 
provided below describes the introduction of a nucleic acid containing an 
inducible SHORT-RNA expression system into mouse embryonic stem cells. 

Additional techniques that can be used to produce the founder lines of 
transgenic animals include, but are not limited to, pronuclear microinjection 
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(U.S. Pat. No. 4,873,191), retrovirus mediated gene transfer into germ lines (Van 
der Putten et al, Proc. Natl. Acad. Sci., USA 82:6148, 1985), gene targeting into 
embryonic stem cells (Thompson et al., Cell 56:313, 1989); and electroporation 
of embryos (Lo, Mol. Cell. Biol. 3:1803, 1983). For a review of techniques that 
can be used to generate and assess transgenic animals, skilled artisans can 
consult Gordon (Ml. Rev. Cytol. 115:171-229, 1989), and may obtain additional 
guidance from, for example: Hogan et al. "Manipulating the Mouse Embryo" 
(Cold Spring Harbor Press, Cold Spring Harbor, NY, 1986; Krimpenfort et al., 
Bio/Technology 9:86, 1991; Palmiter et al., Cell 41:343, 1985; Kraemer et al., 
"Genetic Manipulation of the Early Mammalian Embryo," Cold Spring Harbor 
Press, Cold Spring Harbor, NY, 1985; Hammer et al., Nature 315:680, 1985; 
Purcel et al., Science, 244:1281, 1986; Wagner et al, U.S. Patent No. 5,175,385; 
and Krimpenfort et al., U.S. Patent No. 5,175,384. 

Methods of Inducing the Inducible Expression System 

When the nucleic acids described herein are introduced into eukaryotic 
host cells, the host cell's RNA polymerase HI (Pol HI) is recruited to the Pol HI 
promoter sequence of the nucleic acid. The promoter cannot, however, initiate 
transcription of the short RNA encoding sequence, because of the STOP cassette 
that is located between the Pol III promoter and the short RNA encoding 
sequence. When Pol HI polymerase begins moving downstream of the promoter, 
the polymerase encounters the termination sequence in the STOP cassette and 
aborts transcription before short RNA transcript synthesis begins. 

Induction of short RNA expression in the system described herein is 
achieved by exposing the expression system to a Cre recombinase. The ability 
of Cre recombinase to excise /oxP-flanked sequences of DNA has been 
extensively described. See, e.g., Guo et al., Nature 389, 40-46 (1997) and Lakso 
et al., Proc Nat'l Acad. Sci. USA 89, 6232-6236 (1992). Briefly, Cre 
recombinase recognizes loxP sites flanking a DNA sequence and either excises 
or inverts the DNA sequence between the two loxP sites. Although, loxP 
sequences contain two inverted 13 bp repeats, the 8 spacer nucleotides are not 
palindromic and provide loxP sites with an orientation. Excision occurs between 
two loxP sites oriented in the same direction, while inversion occurs between 
loxP sites that are oriented in opposite directions. ACre-mediated excision 
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reaction removes all the DNA between the two original loxP sites and leaves 
behind one loxP sequence. 

In the present system, a Pol HI termination sequence is flanked by two 
loxP sequences. Thus, in the present system, a Cre-mediated excision results in 
the removal of the DNA that encodes the Pol III termination signal. After 
removal of the termination signal, Pol HI is free to bind the Pol III promoter 
sequence of the expression system and transcribe the short RNA encoding 
sequence that is downstream of the promoter. Having removed the termination 
signal of the STOP cassette, there remains only one loxP sequence between the 
promoter sequence and the short RNA encoding sequence, thereby allowing for 
transcription of the short RNA encoding sequence. 

Optimizing short RNA expression 

In applications such as the synthesis of siRNA-like and micro-RNA-like 
gene silencing, the exact transcript sequence generated by the short RNA 
encoding sequence can be very important. For example, single base pair 
mutations can abolish the ability of a transcript to induce RNAi. It is also 
undesirable to include extraneous sequence in a short RNA transcript, as the 
extraneous sequence can also abolish gene silencing. Therefore a short RNA 
expression system should include features that eliminate unwanted mutations or 
extraneous sequence in the short RNA transcript 

The fact that Cre-mediated recombination leaves behind one 34 base pair 
loxP sequence between the promoter sequence and the short RNA encoding 
sequence can create a problem. Since Pol HI promoters start transcription from 
between 32 and 25 base pairs downstream of the TATA box, it will frequently not 
be desirable to locate the TATA box of the promoter sequence upstream of the 
loxP site that is proximal to the promoter sequence. If the TATA box is placed 
upstream of the promoter-proximal loxP site, then Pol EI transcription site, i.e. 
the (+1) site will be located inside the loxP sequence that remains after a Cre- 
mediated excision. 

This problem can be minimized by taking advantage of the fact that the 
5' end of a loxP site has the following sequence: 5'-Adenine, Thymidine, 
Adenine-3' (ATA). By introducing a thymidine reside immediately upstream of 
the loxP site that is proximal to the promoter sequence, a functional TATA box is 
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produced that will remain after a Cre-mediated recombination event in the 
expression system. 

Nonetheless, transcription can still start within the loxP even though the 
TATA box includes the first three nucleotides of the loxP site. For example, the 
transcription start site of the U6 promoter is 26 base pairs downstream of the 
TATA box. In an inducible expression system modified so that the TATA box 
includes the first three nucleotides of the remaining loxP sequence, a U6 
promoter sequence will cause transcription to begin within the loxP sequence, 
i.e., such a transcript will include sequence encoded by the downstream terminal 
5 bases of the loxP sequence. 

To drive the expression of short RNA transcripts that do not begin with 
the terminal 5 bases of the loxP sequence, the present invention recognizes that 
the loxP sequence that is proximal to the short RNA encoding sequence can be 
mutated, so that after a recombination event, the system expresses short RNA 
transcripts that do not include wild-type loxP sequence. Thus, as shown in the 
Example below, the terminal 5 base pairs of the loxP sequence that is distal from 
the promoter can be mutated to encode the first 5 bases of the desired short RNA 
transcript. The mutation effectively creates an overlap of the mutant loxP 
sequence and the short RNA encoding sequence. The mutation described in the 
Example did not affect recombination efficiency and produced a transcript 
capable of inducing gene silencing. 

This strategy can be generalized and adapted to different promoters and 
different pre-selected short RNA transcript. Once the distance from the TATA 
box to a transcription start site has been determined for a given Pol HI promoter, 
the transcription start site within a remaining loxP in an expression system using 
that promoter can be predicted. The downstream terminal residues of the 
downstream toxP site in the system can then be mutated so that the mutant loxP 
sequence encodes the first one or more bases of a pre-selected short RNA 
encoding sequence, that is the downstream mutant loxP sequence and the 
upstream short RNA encoding sequence overlap. In this manner the system can 
be adapted to produce a variety of exact short RNA transcripts that do not 
necessarily include wild-type loxP sequence. 

Methods of Using the Inducible Expression System 
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The inducible expression system disclosed herein can be used in 
conditional, loss-of-function genetic studies in animals and cells. For example, 
transgenic animals whose genomes incorporate the expression system described 
herein can be crossed with transgenic animals carrying the Cre recombinase gene 
under the control of a temporally or spatially regulated promoter. Temporally 
regulated promoters are developmentally regulated promoters that turn on gene 
expression at specific stages of embryonic or animal development. Spatially 
regulated promoters are promoters that turn on gene expression only in defined 
cellular or anatomical locations, e.g., tissue-specific promoters. Many such 
strains of Cre transgenic mice have been developed that carry a Cre transgene 
under the control of a developmentally-regulated or tissue-specific promoter. 
One notable source of such strains is The Jackson Laboratory, Bair Harbor 
Maine. 

Even a single transgenic mouse line whose genome harbors the inducible 
expression systems of the present invention can be crossed with a variety of 
regulated Cre-expressing transgenic mice to create a variety of double transgenic 
mice, which are suitable for use in many conditional, loss-of-function studies. 
These double transgenic lines can be used to study the effects of knocking down 
expression of a target gene in individual tissues, e.g., to study the effects of 
knocking down expression of a target gene only in neural tissue or only in 
specific cell types. The effect of knocking down the expression of essential 
target genes in adult animals can be studied using double transgenics that contain 
a developmentally-regulated Cre gene that is only expressed in the adult animal. 
Similarly the role of a gene during different stages of development can be 
studied by using different double transgenic mice that carry the same short RNA 
expression construct, but different Cre transgenic constructs that express the Cre 
gene at different stages of development. 

The expression systems described herein can also be used to study the 

effects of knocking down multiple gene products expressed by multiple genes, 

which share some genetic sequence identity. For example, an expression system 

coding for only one siRNA-like molecule can be used to down regulate 

expression of more than one gene product, provided those genes share an 

identical siRNA target sequence. Thus, a single nucleic acid expression system, 

or an organism or a cell carrying one such nucleic acid, of the present invention 
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can be used to study the role of gene products from multiple gene family 
members, provided each member of the gene family shares some sequence 
identity with the other gene family members at the target site of for the short 
RNA that is inducibly expressed by the nucleic acid. The Example provided 
below discloses an expression system designed to produce a single short RNA 
transcript that down regulates several members of the Al group of genes in the 

bcl-2 family of genes. 

The expression system of the present invention can also be used in 
conjunction with other methods of conditionally delivering Cre recombinase to 
animals or cells harboring the nucleic acids disclosed herein. For example, cells 
transformed or transfected with the expression system can be exposed to 
exogenous Cre recombinase. The Cre protein can be delivered into the cells 
using any reagent suitable for the delivery of protein into a cell, e.g., liposomes 
or electroporation. Delivery of the Cre protein into the cell can thereby induce 
the recombination event that allows expression of the short RNA encoding 
sequence. 

The inducible short RNA expression system disclosed herein is a 
powerful tool for conducting conditional loss of gene function experiments. 
Animals or cells harboring the nucleic acids disclosed herein can be induced to 
express the short RNA coded for by the nucleic acids, and changes in these 
animals or cells can be monitored. The types of changes that can be monitored 
include, but are not limited to, physiological changes, molecular biological 
changes, biochemical changes, changes in genetic expression, histological 
changes, gross anatomical changes, behavioral changes, changes in viability, 
changes in morbidity, and changes in mortality. Other changes that can be 
monitored include changes in compound-mediated effects on a cell or on an 
organism, e.g., changes in drug efficacy and/or changes in any other drug- 
induced effect or side effect. 

Screen for Factors that Affect RNAi Efficiency in Mature Lymphocytes 

As shown in Example 2, below, mature lymphocytes display a less 
efficient RNAi-mediated knockdown of target genes, despite the presence of 
siRNA-RISC complexes. Therefore, these cells either (a) lack or have reduced 
activity of a downstream effector of the RISC complex or (b) contain inhibitors 
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of RNAi-mediated activity. Downstream effectors of RISC may include proteins 
with endonuclease activity, such as ago2 or a "slicer" protein. Other 
downstream effectors include factors that mediate RNAi-mediated translational 
silencing. 

Downstream effectors of the RISC complex with reduced activity in 
mature lymphocytes, i.e., B-cells or T-cells are typically effectors that (a) have 
reduced expression or (b) are differently modified (e.g., phosphorylated, 
glycosylated, truncated) in mature lymphoctes. Candidate downstream effectors 
with reduced expression or differential modification in mature lymphocytes can 
be evaluated at the mRNA (e.g., using real time RT-PCR, RNAse protection, 
differential display, or gene chip assays) and/or protein (e.g., by Western blot, 2D 
SDS PAGE, immunoprecipitation, or peptide chip assays) level. Expression 
level or modification in mature lymphocyte are then compared to another cell 
type, e.g., a thymocyte. Lower levels or altered protein modification (e.g., 
phosphorylation or glycosylation) of the downstream effector can indicate that it 
is a missing downstream effector of RNAi in T cells or B-cells. 

Downstream effectors with reduced activity can be added to mature 
lymphocytes in order to increase the efficiency of RNAi-mediated in the cells. 
Therefore, a screen for missing effectors is useful in the identification of (a) co- 
therapeutics that can be used in conjunction with siRNA-dependent therapeies 
and/or (b) as a research tool to enhance RNAi in mature lymphocytes of animals 
or in cultured mature lymphocytes. 

Inhibitors of RNAi in mature lymphocytes, i.e., B-cells or T-cells, can 
also be identified using comparative assays of mRNA (e.g., using real time RT- 
PCR, RNAse protection, differential display, or gene chip assays) and/or 
comparative level protein assays (e.g., by Western blot, 2D SDS PAGE, 
immunoprecipitation, or peptide chip assays). Expression level(s) in mature 
lymphocyte are then compared to another cell type, e.g., a thymocyte. Increased 
expression or altered protein modification (e.g., phosphorylation, glycosylation, 
truncation) of a peptide can indicate that it is an inhibitor of RNAi in mature 
lymphocytes. 

Screens for mature lymphocyte inhibitors of RNAi are useful for the 
identification of therapeutic targets and/or new research tools. Successfiil 
targeting and inactivation of mature lymphocyte inhibitors of RNAi can facilitate 
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the therapeutic use of siRNAs in mature lymphocytes, e.g., B-cells or T-cells. 
Thus, for example, small molecule antagonists or antibodies that bind to mature 
lymphocyte inhibitors of RNAi are useful as (a) co-therapeutics in conjunction 
with siRNA-dependent therapies or (b) to enhance RNAi in animal models or 
cultured cells. Mature lymphocyte inhibitors can also be used to suppress RNAi 
activity in cells, e.g., to prevent unwanted side effects of siRNAs in healthy 
tissues. 

EXAMPLES 

The invention is further described in the following examples, which do 
not limit the scope of the invention described in the claims. 
Example 1 

An exemplary short RNA expression system was built and tested for its 
ability to inducibly initiate RNAi in vitro. This short RNA expression system, 
referred to herein as the U6-STOP-shAl construct, was inducibly produce short 
hairpin RNAs (shRNAs) that target the Ala, Alb, and Aid genes of the bcl-2 
family. Generally, shRNAs are short hairpin RNAs that can be degraded to 
siRNAs that activate RNAi. The U6-STOP-shAl construct was inserted into the 
genome of mouse embryonic stem cells, shRNA transcription was induced, and 
the construct was found to selectively knockdown the expression of an Al -fusion 
reporter gene. 

TIte U6-STOP-shAlconstruct 

The construct used in this example included, a U6 promoter sequence, a 
/oxP-flanked STOP cassette, and an shRNA sequence. The STOP cassette 
included the U6 transcription termination sequence. The U6 termination 
sequence consisted of the wild-type run of consecutive thymidines, i.e. the T- 
stretch, and 190 bp of genomic DNA downstream of the T-stretch. An additional 
T-stretch was inserted next to the endogenous U6 T-stretch to enhance the 
efficiency of transcriptional termination of the STOP cassette. Insertion of the 
/OTP-flanked STOP cassette between the U6 promoter and the shRNA gene 
required several adjustments in order to ensure proper shRNA transcription upon 
Cre-mediated deletion of the STOP sequence. 

Transcriptional initiation at the transcription initiation start site (+1) is 
crucial for the precise generation of short RNAs by RNA Pol ID . Deletion of 
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the STOP cassette leaves only one loxP site at the site of Cre-mediated loxP 
recombination. If the STOP cassette were inserted after the (+1), this would 
result in a /oxP-shRNA fusion transcript, which could interfere with proper 
shRNA processing and siRNA generation. To avoid transcription of the loxP 
site, it had to be integrated into the U6 promoter between the TATA box and 
(+1). Mutational analysis of the Pol III promoter suggested that this sequence 
could be altered without affecting the efficiency of Pol m -mediated 
transcription. See, e.g., Myslinski et al., Nucleic Acids Res. 29:2502-2509. 
(2001). However, since the (+1) site is located 26 bp downstream of the U6 
TATA box and one loxP site comprises 34 bp, accommodation of a loxP site in 
the U6 promoter required the following adjustments: the first 3 bp of the loxP 
site (ATA) were integrated into the TATA box and the last 5 bp of the shRNA- 
proximal loxP site was exchanged for the first 5 bp of the shRNA coding 
sequence. A 5 bp mutation at the distal end on the inverted repeat was not 
expected to dramatically decrease recombination efficiency. Figure 1 A shows a 
schematic view of the inducible construct. 

The entire construct is referred to as the U6-STOP-shAl cassette or U6- 
STOP-shAl construct. The shRNA sequence is referred to as, shAl, since it is 
directed against the bcl-2 family members Ala, Alb and Aid. Upon expression 
of shAl, the RNAi-processed RNA transcript produced is referred to herein as 
siAl. 

The U6-STOP-shAl cassette was cloned in three steps. First, the 
modified U6 promoter was PCR amplified from the U6 promoter containing 
plasmid P U6 (Sui et al, Proc. Nat'l Acad. Sci. USA 99:5515-5520 (2002) using 
primers Xbal-U6 and U61ox-T-RI (see Table I). The 5' primer introduced an 
Xbal site 5' of the U6 promoter, the 3' primer replaced the sequence 3' of the 
TATA box with a loxP site, two T-stretches, and an EcoRI site. Second, the Pol 
III termination sequence was PCR amplified from C57BL/6 genomic DNA using 
primers U6termRI and U6termlB (see Table I), which introduced a 5'EcoRI site 
and a 3' BamHI site. Third, a fragment consisting of a mutant loxP site fused to 
shAl was generated by oligonucleotide synthesis of two complimentary 
oligomers, lox-shAl-s and lox-shAl-as (see Figure lAfor sequence 
information). The annealed oligomer contained a 5' BamHI site and a 3' Hindm 
site. The three subfragments from each of the steps listed above were cloned 
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into a modified pBS-polylinker resulting in an Ascl-flanked U6-STOP-shAl 
construct. The sequence of the U6-STOP-shAl construct is shown in Table 2. 
Table 1 Primers for polymerase chain reaction (PCR) 



Name 


Sequence (S'-S') 


Location 


-> 


LAR53 


nn a cctcc a TcrnrTnTTATTT 


5* of DQ52 


s* 


CDR3-PE 


GGTCTATTACTOTOL. AAO 1 1 uu 


CDR3 of VPE 


as 


U6termRI 


TGTGAATTCGTTCCTCAGAGGAACTGA 


J 01 uo gene 




U6terml-B 


TGTGGATCCCCCGGGCGTGGCTTGGTGGTACACCTC 


J oi gene 


as 


Xbal-U6 


GACTCTAGATCCGACGCCGCCATCTCTAG 


uo prornoivi 


s 


U61oxTrRI 


TGCGAATTCAAAAATCGCAAAAACGTAATAACTTCGTATA 
AGTATGCTATACGAAGTTATAGTCTCAAAACACACAATTA 
CTTAC 


U6 promoter 


as 


Ald-X 


TGCTCGAGATGTCTGAGTACGAGTTCATGCATATC 


AldcDNA 


s 


mutAl-B 


CTGGATCCTTATTTCAGCAGGAACAGCATCTCCCATATCTG 


AldcDNA 


as 


Ald-B 


CTGGATCCTTACTTGAGGAGAAAGAGCATTTC 


AldcDNA 


as 


HPRT-SAH 


TTCCTAATAACCCAGCCTTTG 


pMP-tOSAH 


s 


hHPRT-pro 


GTGATGGCAGGAGATTTGTAA 


hHPRT promoter 


as 


Abbreviations in Table 1: s, sense strand; as, antisense strand 



Table 2 Sequence of the U6-STOP-shAl construct 

5'- 

tccgacgccgccatctctaggcccgcgccggccccctcgcacagacttgtgggagaagctcggctactcccctgc 
cccggttaattt 

gcatataatatttcctagtaactatagaggcttaatgtgcgataaaagacagataatctgttctttttaatactagctacat 

tttacatgataggcttggatttctataagagatacaaatactaaattattattttaaaaaacagcacaaaaggaaactca 

ccctaactgtaaagtaattgtgtgttttgagactataacttcgtatagcatacattatacgaagttattacgtttttgcgattt 

ttgaattcgttcctcagaggaactgacaagcaccctaacatcctattggaggctcactcacgttttttctattttgtttcttg 

acagcagagctcgttgctcactgtatagctcaggttggcctgacactgatgaggttctccagtgactgcctctaccta 

cctactgggatgacagaggtgtaccaccaagccacgcccgggggatccataacttcgtatagcatacattatacga 

aggaaatgctctttctcctcaaagctttgaggagaaagagcatttcccttttt-3' (SEQIDNO: 1) 

The nucleotide sequence of the U6-STOP-shAl construct in Table 2 encodes the 
following functional units (numbering begins at the 5' end): 
Bases 1-282, which include the U6 promoter sequence upstream of the TATA 
box, are 
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tccgacgccgccatctctaggcccgcgccggccccctcgcacagacttgtgggagaagctcggctactcccctgc 
ccc 

ggttaatttgcatataatatttcctagtaactatagaggcttaatgtgcgataaaagacagataatctgttctttttaatact 

agctacattttacatgataggcttggatttctataagagatacaaatactaaattattattttaaaaaacagcacaaaagg 

aaactcaccctaactgtaaagtaattgtgtgttttgagac (SEQ ID NO: 2). 

Rases 283-287 are the TATA box, i.e, tata (SEQ ID NO: 3). 

Bases 284-317 include the first loxP site upstream of the STOP cassette which is 

a wild type loxP sequence, i.e., ataacttcgtatagcatacattatacgaagttat (SEQ ID NO: 

4). 

Bases 318-530 is a STOP cassette, which includes aU6 Pol m termination 
sequence termination sequence supplemented with an additional 4 thymidine 
stretch (underlined in the following sequence), i.e., 

tacgtttttgcgatttttgaattcgttcctcagaggaactgacaagcaccctaacatcctattggaggctcactcacg 
ttttttctattttgtttcttgacagcagagctcgttgctcactgtatagctcaggttggcctgacactgatgaggttctcca 
gtgactgcctctacctacctactgggatgacagaggtgtaccaccaagccacgcccgggggatcc (SEQ ID 

NO: 5). 

Bases 331-537 include genomic U6 Poim termination sequence, i.e., 
tttttgaattcgttcctcagagg 

aactgacaagcaccctaacatcctattggaggctcactcacgttttttctattttgtttcttgacagcagagctcgttgct 
cactgtatagctcaggttggcctgacactgatgaggttctccagtgactgcctctacctacctactgggatgacagag 
gtgtaccaccaagccacgcccgggggatcc (SEQ ID NO: 6). 

Bases 543-577 is the mutant second loxP site downstream of the STOP cassette; 
i.e., ataacttcgt 

atagcatacattatacgaaggaaa (SEQ ID NO: 7). 

Note that the five 3' terminal nucleotides of SEQ ID NO 6 are the 5' terminal 
sequence of the shAl transcript. 

Bases 572-623 includes the shAl hairpin transcript plus a 3' T-stretch. 

Figure 1 A shows diagrams of (top) the U6-STOP-shAl construct before 
a Cre-mediated excision of the termination sequence, and (bottom) the U6- 
STOP-shAl construct after a Cre-mediated deletion of the termination sequence, 
the latter is referred to as U6A-shAl. Triangles are loxP sites, the STOP 
rectangle is a modified U6 termination sequence comprising two T-stretches and 
190 bases of wild-type genomic sequence immediately downstream of the 
genomic U6 T-stretch, and U61ox is a modified U6 promoter sequence 
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containing a loxP site. Sequence from TATA box to the T-stxetch following 
shAl sequence is shown below U6A-shAl. Below the diagrams. Figure 1 A 
shows the sequence between the TATA box and the transcriptional termination 
site of shAl . Also shown is the overlap between TATA and the 5' end of the 
loxP sequence (underlined), as well as the overlap (double underlined) between 
the upstream (5') 5 bp of the shAl encoding sequence and shAl proximal 
terminus of the mutant loxP. The distance from the 3' end of the TATA box to 
the shRNA transcription initiation site (+1) is 26 bases. 

Insertion of the Construct into HPRT deficient HM1 Embryonic Stem Cells 

As a first step in generating transgenic mouse strain that allows 
ubiquitous induction of shAl -mediated RNAi upon Cre-mediated recombination 
in a defined genetic locus, the U6-STOP-shAl construct was targeted into the X- 
linked hypoxanthine phosphoribosyltransferase (HPRT) locus by homologous 
recombination in ES cells. This approach takes advantage of the fact that 
HPRT-deficient HM-1 ES cells permit extremely efficient selection of 
transgenes inserted into the HPRT locus. Thompson et al., Cell 56:313-321 
(1989). HM-1 ES cells lack the HPRT promoter and exons 1 and 2. Only 
reconstitution of the disrupted HPRT locus by gene targeting confers resistance 
to HAT selection. Hence, virtually every HAT-resistant ES cell colony carries 
the targeted HPRT allele. A targeting vector that allows the insertion of 
transgenes into HM-1 ES cells has been described previously (pMP-8SKB, 
(Bronson et al., 1996)). A modified version of this vector referred to as pMP-10, 
has been developed, which can be linearized with Swal, Sbfl or Sgfl and harbors 
two additional unique restriction sites (AscI and Pmel) to insert the transgene of 
choice. The U6-STOP-shAl cassette was inserted into the AscI restriction site 
of pMP-10 in the same transcriptional orientation as the HPRT gene. The 
targeting vector was linearized with Swal and transfected into HM-1 ES cells. 

The targeting strategy is shown in Figure IB. The left side of Figure IB 
shows a partial restriction map of the HPRT wild-type genomic locus (HPRT 
WT), below which is a partial restriction map of the HM1 mutant HPRT 
genomic locus (HM1), and below both of which, is a partial restriction map 
showing the insertion, i.e. Knock-In, of the U6-STOP-shAl construct into the 
HM1 mutant HPRT locus (U6-STOP-shAl KI). HPRT exons are shown as 
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boxes with roman numerals above them. Stul restriction sites are marked by a 
capital S. 

The right hand of Figure lb shows a Southern blot confirming insertion 
of the U6-STOP-shAl construct. The integrity of HAT-resistant colonies was 
confirmed by Southern blotting using a Stul digest and probe RSA. RSA is 
shown in Figure 2a above the general location of its binding site near HPRT 
exon III. Two independent ES cell clones were injected into C57BL/6 
blastocysts. 

Testing the Construct 

The ability of the U6-STOP-shAl construct to effect Cre-mediated 
induction of shRNA expression and subsequently knock down Al expression 
was tested in transgenic ES cells. Endogenous Al expression is barely 
detectable in ES cells. Therefore, to increase measurable Al signal, a transgene 
encoding an Al-IRES-EGFP reporter protein was introduced into targeted ES 
cells. Al cDNA was fused to DNA containing an internal ribosomal entry site 
(IRES) followed by EGFP cDNA. Expression of this fusion construct resulted 
in a bicistronic mRNA encoding Al and EGFP. Degradation of this construct by 
siAl -mediated mRNA degradation was predicted to result in loss of both Al and 
EGFP expression. 

The coding sequence of the mouse Aid gene was PCR-amplified from 
splenic cDNA usingprimers Ald-X and Ald-B (see Table I), which introduced 
into pIRES2-EGFP (Clontech, Palo Alto, CA) to generate the Al-IRES-EGFP 
fusion construct depicted in Figure 2 A. A second, mutated Al expression 
construct was generated by PCR amplification using primers Ald-X and 
mutAld-B (see Table I) to introduce 6 conservative mutations at the siAl target 
site. The PCR fragment of both Aid and mutA were then subcloned into 
BamHI/XhoI-digested pIRES2-EGFP to generate the fusion constructs depicted 
in Figure 2A. 

Al-IRES-EGFP, mutAl -IRES-EGFP and IRES-EGFP fragments were 
excised from the respective pIRES2-EGFP vectors using Xhol andNotl and 
inserted into an Xhol site 3' of the chicken b-actin promoter in the pCXN2 
vector. This vector contains a neoR selectable marker. Niwa et al., Gene 
108:193-199 (1991). Expression vectors were Sail-linearized and transfected 
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into U6-STOP-shAl ES cells. Stable integrants were selected with G418 
starting 2 days after transfection. Single G418-resistant ES cell colonies were 
analyzed for EGFP expression in order to confirm expression of the reporter 
transgene. 

The three fusion constructs depicted in Figure 2A were used to verify 
specific RNAi-mediated gene silencing by shAl. Al box represents the Al 
cDNA sequence, the IRES box represents the internal ribosome entry site 
sequence, EGFP box represents the EGFP gene, and pA box represents the 
polyadenylation (poly A) site from the pCNX2 expression vector. The mutAl 
box represents the mutated Al cDNA; gray letters in the sequence below the 
mutAl box indicate mutated bases. The siAl box represents predicted product 
of RNAi processed shAl transcript, the siAl box is depicted above the siAl 
target site. 

Cre-mediated induction of RNAi in ES cells 

EGFP + clones of each transgenic ES cell line were transduced with a Cre 
expressing adenovirus in order to delete the U6 Pol HI termination sequence of 
the Z^P-flanked STOP cassette and thereby induce shRNA expression. See, 
e.g., Bassing et al, Cell 109 Suppl:S45-55 (2002). Transduced U6-STOP-shAl 
ES cells carrying the deleted STOP cassette are also referred to as U6A-shAl 
cells. Untransduced cells served as negative control. Seven days after 
transduction, ES cells were analyzed for EGFP expression by FACS analysis. 

Figure 2B depicts FACS analysis of EGFP expression in transduced 
(open histograms) or untransduced ES cells (shaded histograms). The respective 
EGFP transgene is indicated, and AV-Cre stands for Cre expressing adenovirus. 
The results show that only ES cell clones that were exposed to Cre and carried 
the perfectly complementary AMRES-EGFP transgene showed downregulation 
of EGFP expression, demonstrating sequence-specific and inducible RNAi in 
U6-STOP-shAl ES cells. 

The observed downregulation of EGFP in only -60% of cells likely 
reflects incomplete deletion of the STOP cassette. This was confirmed by PCR 
analysis of genomic DNA isolated from total cell lysate or subpopulations that 
were sorted according to EGFP expression levels. PCR analysis strategy can be 
appreciated by reference to Figure 2C. Figure 2C is a diagram of the targeted 
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HPRT locus. Half-arrows depict primers hHPRT-pro and HPRT-SAH (see 
Table I) flanking the inserted U6-STOP-shAl cassette. The bent arrow 
represents the human HPRT promoter, the gray box depicts human exon 1, the 
white box mouse exon 2; map is not drawn to scale. Right hand panels of Figure 
2C show PCR results for transduced U6A-shAl and untransduced ES cells 
transgenic for IRES-EGFP (IRES), Al-IRES-EGFP (Al) or mutAl-IRES-EGFP 
(mutAl). Al-IRES-EGFP transgenic ES cells were sorted according to EGFP 
expression levels. DNA from EGFPhigh cells and EGFPlow cells was subjected 
to PCR. The expected sizes for PCR fragments before (U6-STOP-shAl) and 
after deletion of the Pol III STOP cassette (U6A-shAl) are indicated. The 
asterisk indicates a fragment resulting from a heterozygous ES cell with one U6- 
STOP-shAl strand and one U6A-shAl strand. Figure 2C shows that deletion of 
the STOP cassette was incomplete in the bulk sample and was exclusively 
detected in GFPlow cells. 

Importantly, Figures 2D, 2E, and 2F show that similar levels of Cre- 
mediated deletion and concomitant siRNA generation were detected in all Cre- 
treated ES cell lines, emphasizing the specificity of siAl for Al-IRES-EGFP 
mRNA. To determine the extent of mRNA degradation, EGFP containing 
mRNA levels were analyzed by Northern blotting using a probe specific for 
EGFP. 20 mg of total RNA were loaded per lane. The results of Northern blot 
analysis of transduced and untransduced ES cells carrying the indicated 
transgene are shown in Figures 2D, 2E and 2F. Synthetic double-stranded 
siRNA of identical sequence were loaded in the amounts indicated above the 
siAl lanes of Figure 2D and 2E to estimate siRNA expression levels. 

Detection of GAPDH mRNA served to normalize for loading 
differences. Detection of processed 21-23 nt siRNA by Northern blotting is 
generally restricted to RISC incorporated siRNA molecules, since single- 
stranded siRNA molecules that are not RISC-protected are rapidly degraded. 
Indeed, Figure 2E shows an inability to detect processed siAl anti-sense (as) 
strand, indicating that our assay predominantly detects functional, RISC 
protected siRNA. 

To determine the extent of mRNA degradation, (Al)IRES-GFP mRNA 
levels were analyzed by Northern blotting using a probe specific for GFP. 
Results are shown in Figure 2F. The size of the mRNA differed depending on 
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the presence or absence of the Al cDNA. EGFP mRNA levels were strongly 
reduced in total cell lysate and the remaining mRNA is likely to originate from 
cells that have not undergone deletion of the STOP cassette. Indeed, when cells 
were sorted according to GFP expression, Al-ERES-GFP mRNA was barely 
detectable in GFPlow cells. Image quantification showed a >10 fold reduction 
of mRNA when compared to GFPhigh cells. No mRNA reduction was observed 
in untransduced Al-IRES-GFP transgenic ES cells nor in 3RES-GFP control 
samples. These data demonstrate that a single copy of the U6-STOP-shAl 
cassette mediates efficient, sequence-specific and tightly regulated suppression 
of Al in vitro. 

Example 2 

To evaluate the ability of an shRNA expression system to generate RNAi 
in vivo and in a tissue specific maimer, a number of transgenic mice were 
developed using ES cells described in Example 1. ES cells carrying aU6- 
STOP-SHA1 allele were crossed with a mouse strain that expresses Cre during 
early embryogenesis to generate germline U6A-shAl transgenic mice (U6A- 
shAl mice). These constitutive shAl expressing mice were viable and 
phenotypically indistinguishable from wild-type littermates. U6-STOP-SHA1 
ES cells were used to generate U6-STOP-SHA1 transgenic mice. U6-STOP- 
SHA1 transgenic mice were crossed with CD19-cre mice that express Cre 
exclusively in B-cells by placing the cre gene under control of the CD 19 B- 
lineage cell-specific promoter as described in Rickert et al., Nucleic Acids Res., 
25:1317-8 (1997). The mice resulting from this cross are referred to herein as 
U6-STOP-SHAl/CD19-cre mice. Expression of siAl in lymphocyte subsets 
from different mice was evaluated by Northern blotting as shown in Figures 3 A. 

Expression of siRNA is Tightly Dependent on Induction by Cre In Vivo 

Figure 3 A, lane 1, shows that U6A-shAl mouse-derived thymocytes 
robustly expressed siAl. Based on the siAl loading control and the fact that 
lymphocytes express 1-2 ug total RNA per 10 6 cells, the Northern blot indicated 
that thymocytes expressed 500-1000 copies of processed siAl per cell. Figure 
3 A, lane 2, on the other hand, shows no significant expression of siAl was 
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detected in thymocytes from U6-STOP-SHAl/CD19-cre mice, which is 
expected since CD-19-cre mice do not express Cre in thymocytes. 

Generally, the data indicate that (i) an induced shRNA expression system 
can generate siRNA in vivo, and (ii) that uninduced cells carrying an shRNA 
expression construct are tightly regulated and, thus, do not express siAl in 
tissues where Cre is not present 

An shRNA Expression System Induced Efficient Tissue-Specific Deletion of 
STOP Casette and siRNA Production 

A comparison of lanes 2, 5, and 6 of Figure 3A show that expression of 
siAl was restricted to B cells (lanes 5 and 6) from U6 STOP shAl/CD19-cre 
mice and no detectable siAl expression occurred in thymocytes (lane 2) from the 
same transgenic mice. These results indicate that since deletion of the loxP- 
flanked STOP cassette is strongly dependent on Cre expression, tissue specific 
Cre expression can be used to control tissue-specific siRNA production using an 
shRNA expression system. 

A semi-quantitative PCR assay was used to asses the efficiency of 
deletion of the STOP cassette. PCR primers shown in Figure 2C were used (as 
described in Example 1) to amplify genomic DNA from bone marrow and 
splenic B cells derived from U6 STOP shAl/CD19-cre. PCR products from 
these reactions are shown in the right hand panel of Figure 3B. Predicted size of 
PCR product from cells in which the STOP cassette was deleted (U6A-shAl) or 
remained intact (U6-STOP-shAl) are indicated. The left panel of Figure 3B is a 
reference standard showing a titrated series of mixtures of PCR products from 
U6A-shAl and U6-STOP-shAl mice. Amount of U6A-shAl PCR product as 
percentage of each titration mixture is indicated above the left panel. 

The two panels in Figure 3B indicate that the STOP cassette was deleted 
with an estimated efficiency of -90% in mature B cells. 

Taking into account the incomplete deletion of STOP cassette in U6 
STOP shAl/CD19-cre mice, siAl expression in B cells from U6 STOP 
shAl/CD19-cre mice was comparable to those observed inU6A-shAl B cells 
(compare lanes 3 an 4 with lanes 5 and 6 of Figure 3 A, and note that half the 
amount of RNA was loaded in lanes 5 and 6). As a consequence of the defined 
genetic location of a single siAl expression cassette, siAl expression levels were 
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reproducible between different animals (Figure 3A, lanes 3, 4 and 5, 6) in both 
U6-STOP-shAl; CD19-cre mice and in U6A-shAl mice. 

Generally, these results indicate that an shRNA expression system can be 
used (i) to reproducibly reduce expression of selected gene products in selected 
tissues in vivo and (ii) to study the effects of gene silencing in a tissue-specific 
manner. 

Efficiency ofRNAi vanes between cell-types and developmental stages 

Although RISC-stabilized siAl expression was observed in all 
lymphocyte subsets, the efficiency of Al mRNA knock-down did not appear to 
be the same in all tissues. For example, lanes 3 and 4 of Figure 3 A show that 
U6A-shAl mouse-derived splenic B cells express approximately threefold less 
siAl than U6A-shAl -derived thymocytes. A similar three-fold decrease of siAl 
expression was observed in U6A-shAl T cells (data not shown). 

To quantify Al mRNA, a real time RT-PCR strategy was designed to 
detect all three isoforms of Al. Total RNA was subjected to quantitative PCR 
analysis using SYBR GREEN PCR core reagents (Applied Biosystems) as 
described by the manufacturer and the iCycler iQ Real-Time PCR Detection 
System (Bio-Rad). Al was amplified using primers Al-s 
(CATTAACTGGGGAAGGATTGTGAC) and Al-as 
(GCAGAAAAGTCAGCCAGCCAGATT). L32 ribosomal RNA served as 
internal reference and was amplified using the primer pair L32B-S 
(CAAGAGGGAGAGCAAGCCTA) and L32B-as 
(CGTCTCAGGCCTTCAGTGAG). 

Figure 4A shows Al mRNA normalized to ribosomal RNA L32 in the 
indicated cell lines from indicated transgenic animals (wild-type (light grey), 
U6A-shAl (black), and U6-STOP-shAl/CD19-cre mice (dark grey)). U6A- 
shAl mice reveal efficient RNAi-mediated knock down of Al in thymocytes 
(-16% of Al wild-type mRNA levels), but significantly less efficient RNAi- 
mediated knock down of Al in splenic T and B cells (33% and 47% of wild-type 
mRNA levels, respectively). 

Figure 4B shows that despite variations in knock-down efficiencies 
between mice, the same characteristic tissue-specific expression efficiency 
pattern was observed in every animal. Circles represent multiple U6-STOP- 

36 



WO 2005/070044 PCT/US2005/003104 

shAl; CD19-cre mice, squares, triangles and diamonds represent individual 
U6A-shAl mice. Grey bars show the average % of wild-type Al mRNA levels. 
Furthermore, no significant differences were observed in the distribution of 
lymphocyte subsets between U6A-shAl mice and wild-type littermates (data not 
shown), suggesting that the U6A-shAl allele does not affect lymphocyte 
development. 

In U6-STOP-shAl ; CD19-cre animals, knockdown is restricted to B 
lineage cells but is less efficient than in splenic B cells from U6A-shAl mice. 
This can be partially explained by the fact that -10% of splenic CD19-cre B 
cells still carry the STOP cassette and hence do not downregulate Al (see Figure 
3C). In addition, cells that have only recently deleted the STOP cassette may not 
yet have reached maximal knock-down efficiency. 

The data indicate that siAl -mediated reduction of Al expression is less 
efficient in mature lymphocytes, particularly in B and T-cells, and that this 
phenomenon is not due to a lack of RISC protected siRNA, as shown by Figures 
3A. 

Cell-type specific differences in RNAi are not solely attributable to the 
siRNA:mRNA ratio 

The observation that thymocytes undergo RNAi more efficiently than 
lymphocytes is consistent with the observation that thymocytes express more 
siAl molecules per Al mRNA than mature lymphocytes. To test whether this is 
the sole cause for cell-type specific differences in RNAi efficiency, we analyzed 
the knock down efficiency in U6A-shAl thymocytes that were stimulated in 
vitro through TCR cross-linking, since TCR signaling is known to induce Al 
mRNA expression. Figure 5A shows the results of real time RT-PCR analysis of 
Al mRNA levels (normalized to L32 mRNA) in stimulated (CD3/CD28) and 
unstimulated (ex vivo) thymocytes from U6A-shAl (black), wild-type (dark 
grey), and U6-STOP-shAl/CD7P-cre mice (light grey). Figure 5B shows 
Northern blot for siAl in equal total RNA samples from stimulated (CD3/CD28) 
and unstimulated (ex vivo) thymocytes from the indicated transgenic mouse 
strain. 

Figure 5A shows a threefold increase in Al expression 6 hours after TCR 
cross-linking in wild-type thymocytes, yielding Al mRNA levels equivalent to 
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those in splenic B cells. However, Figure 5A shows that in the presence of the 
U6A-shAl transgene, Al levels were reduced by 80% both in stimulated and 
unstimulated thymocytes. Given the significant increase in Al mRNA 
expression relative to L32 mRNA and the decrease in siAl expression relative to 
total RNA upon activation (shown in CD3/CD28 lane of Figure 5B), the data 
indicate that each siAl molecule degraded at least three-fold more Al mRNA 
molecules in stimulated thymocytes relative to unstimulated thymocytes. This 
result suggests that the efficiency of RNAi is not simply a function of siAl:Al 
mRNA stoichiometry. 

To investigate whether this phenomenon extended to other cell types, Al 
mRNA levels and siAl levels were both normalized to the L32 mRNA, allowing 
us to compare siAl :A1 mRNA ratios between the different cell types. The real 
time RT-PCR strategy described above was used to analyze mRNA levels. 
Figure 6 A shows that similar amounts of siRNA and Al mRNA per total RNA 
were detected in splenic B cells from U6-STOP-shAl/CDiP-cre mice, and in ES 
cells carrying the Al-IRES-GFP and U6-STOP-shAl transgenes. However, Al 
levels are dramatically reduced only in EGFP low ES cells. 

Figure 6B shows that bone marrow-derived macrophages (BMDM) from 
U6A-shAl mice undergo RNAi 2-3 fold more efficiently than splenic B cells 
from the same animal. This efficiency increase is observed despite the fact that 
macrophages express greater than tenfold more Al mRNA as shown in Figure 
6C. 

Figure 6D shows a Northern blot indicating that equal amounts of RNA 
from B cells and BMDM contain a comparable number of RISC protected siAl 
molecules. Thus BMDM a greater thanl 0-fold decrease in siRNA:mRNA ratio 
relative to B cells. 

Taken together, the findings in Figure 6A-6D show that the efficiency of 
RNAi in vivo varies between tissues in a manner that is not solely determined by 
the ration of siRNA to target mRNA. 

Inefficient RNAi in Mature T cells is not Restricted to siAl 

To confirm that inefficient siRNA target knock-down in mature 
lymphocytes is not specific to siAl, retroviral gene transfer was used to integrate 
unrelated shRNA expression cassettes into the genome of ex vivo isolated splenic 
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CD4 T cells. E3 ligases Itch and Nedd-4 were chosen as shRNA target genes, 
since reduction in their expression levels does not negatively affect T cell 
proliferation or survival. Target protein expression was evaluated by Western 
blots shown in Figure 7 using anti Itch antibody (184520, BD Transduction Labs, 
Lexington, Kentucky) at 1:1000 and anti Nedd4 antibody (07-049, Upstate 
Biotechnologies, Lake Placid, NY) at 1:20,000 and visualized using 
chemoilluminiscence plus kit (Perkin Elmer). 

The left panels of Figure 7 show target protein levels of NIH-3T3 
fibroblasts retrovirally transduced with Itch-specific (si-itch) and Nedd-4- 
specific (si-nedd4) siRNAs. Under the same transfection conditions, siRNAs 
mediated greater than 10-fold knock-down of Itch and Nedd-4, respectively. On 
the other hand, the right panel of Figure 7 shows that target gene expression was 
reduced by only -50% in CD4 T-cells, despite the presence of processed siRNA 
molecules (data not shown). 

These results indicate that mature lymphocytes are generally impaired in 
their ability to undergo RNAi. 

OTHER EMBODIMENTS 

It is to be understood that while the invention has been described in 
conjunction with the detailed description thereof, the foregoing description is 
intended to illustrate and not limit the scope of the invention, which is defined by 
the scope of the appended claims. Other aspects, advantages, and modifications are • 
within the scope of the following claims. 



39 



